Dataset statistics
| Number of variables | 12 |
|---|---|
| Number of observations | 115 |
| Missing cells | 32 |
| Missing cells (%) | 2.3% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 10.9 KiB |
| Average record size in memory | 97.1 B |
Variable types
| NUM | 10 |
|---|---|
| CAT | 2 |
year_month has a high cardinality: 63 distinct values | High cardinality |
transfer_value_gbp is highly correlated with transfer_value_eur and 6 other fields | High correlation |
transfer_value_eur is highly correlated with transfer_value_gbp and 6 other fields | High correlation |
transfer_value_inr is highly correlated with transfer_value_eur and 6 other fields | High correlation |
revenue_value_eur is highly correlated with transfer_value_eur and 6 other fields | High correlation |
revenue_value_gbp is highly correlated with transfer_value_eur and 6 other fields | High correlation |
revenue_value_inr is highly correlated with transfer_value_eur and 6 other fields | High correlation |
transfers is highly correlated with transfer_value_eur and 6 other fields | High correlation |
new_users is highly correlated with transfer_value_eur and 6 other fields | High correlation |
transfer_value_gbp has 6 (5.2%) missing values | Missing |
transfer_value_inr has 6 (5.2%) missing values | Missing |
revenue_value_gbp has 6 (5.2%) missing values | Missing |
revenue_value_inr has 6 (5.2%) missing values | Missing |
new_users has 2 (1.7%) missing values | Missing |
users has 2 (1.7%) missing values | Missing |
activer_user_rate has 4 (3.5%) missing values | Missing |
year_month is uniformly distributed | Uniform |
transfer_value_eur has unique values | Unique |
revenue_value_eur has unique values | Unique |
Reproduction
| Analysis started | 2021-03-14 12:47:39.200397 |
|---|---|
| Analysis finished | 2021-03-14 12:48:22.311143 |
| Duration | 43.11 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 63 |
|---|---|
| Distinct (%) | 54.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 920.0 B |
| 2015-10 | 2 |
|---|---|
| 2015-06 | 2 |
| 2014-10 | 2 |
| 2016-02 | 2 |
| 2015-11 | 2 |
| Other values (58) |
| Value | Count | Frequency (%) | |
| 2015-10 | 2 | 1.7% | |
| 2015-06 | 2 | 1.7% | |
| 2014-10 | 2 | 1.7% | |
| 2016-02 | 2 | 1.7% | |
| 2015-11 | 2 | 1.7% | |
| 2015-01 | 2 | 1.7% | |
| 2018-01 | 2 | 1.7% | |
| 2016-09 | 2 | 1.7% | |
| 2018-05 | 2 | 1.7% | |
| 2015-07 | 2 | 1.7% | |
| Other values (53) | 95 | 82.6% |
Frequencies of value counts
Unique
| Unique | 11 ? |
|---|---|
| Unique (%) | 9.6% |
Histogram of lengths of the category
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
transfer_type
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 920.0 B |
| Personal | |
|---|---|
| Business |
| Value | Count | Frequency (%) | |
| Personal | 63 | 54.8% | |
| Business | 52 | 45.2% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
| Distinct | 115 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4746857.113 |
|---|---|
| Minimum | 9184 |
| Maximum | 22006935 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 920.0 B |
Quantile statistics
| Minimum | 9184 |
|---|---|
| 5-th percentile | 116695.4 |
| Q1 | 836308 |
| median | 2821555 |
| Q3 | 6415206.5 |
| 95-th percentile | 15697950.5 |
| Maximum | 22006935 |
| Range | 21997751 |
| Interquartile range (IQR) | 5578898.5 |
Descriptive statistics
| Standard deviation | 5220158.764 |
|---|---|
| Coefficient of variation (CV) | 1.099708426 |
| Kurtosis | 1.55856614 |
| Mean | 4746857.113 |
| Median Absolute Deviation (MAD) | 2452103 |
| Skewness | 1.469592552 |
| Sum | 545888568 |
| Variance | 2.725005752e+13 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 5448192 | 1 | 0.9% | |
| 11596522 | 1 | 0.9% | |
| 2821555 | 1 | 0.9% | |
| 10355121 | 1 | 0.9% | |
| 778672 | 1 | 0.9% | |
| 251823 | 1 | 0.9% | |
| 845911 | 1 | 0.9% | |
| 2375083 | 1 | 0.9% | |
| 1262250 | 1 | 0.9% | |
| 18413529 | 1 | 0.9% | |
| Other values (105) | 105 | 91.3% |
| Value | Count | Frequency (%) | |
| 9184 | 1 | 0.9% | |
| 10841 | 1 | 0.9% | |
| 21132 | 1 | 0.9% | |
| 84507 | 1 | 0.9% | |
| 93305 | 1 | 0.9% |
| Value | Count | Frequency (%) | |
| 22006935 | 1 | 0.9% | |
| 21638933 | 1 | 0.9% | |
| 18967356 | 1 | 0.9% | |
| 18413529 | 1 | 0.9% | |
| 17230797 | 1 | 0.9% |
| Distinct | 109 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 6 |
| Missing (%) | 5.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5846470.091 |
|---|---|
| Minimum | 11636.91085 |
| Maximum | 24529384.83 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 920.0 B |
Quantile statistics
| Minimum | 11636.91085 |
|---|---|
| 5-th percentile | 228693.2376 |
| Q1 | 1243789.251 |
| median | 3852180.708 |
| Q3 | 8398004.041 |
| 95-th percentile | 18449571.54 |
| Maximum | 24529384.83 |
| Range | 24517747.92 |
| Interquartile range (IQR) | 7154214.79 |
Descriptive statistics
| Standard deviation | 5898512.235 |
|---|---|
| Coefficient of variation (CV) | 1.008901464 |
| Kurtosis | 1.153747523 |
| Mean | 5846470.091 |
| Median Absolute Deviation (MAD) | 2886438.416 |
| Skewness | 1.333987807 |
| Sum | 637265239.9 |
| Variance | 3.479244659e+13 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 8398004.041 | 1 | 0.9% | |
| 2605233.241 | 1 | 0.9% | |
| 355162.9976 | 1 | 0.9% | |
| 1355941.108 | 1 | 0.9% | |
| 638713.7376 | 1 | 0.9% | |
| 1082385.664 | 1 | 0.9% | |
| 478372.4389 | 1 | 0.9% | |
| 8615640.876 | 1 | 0.9% | |
| 4077663.199 | 1 | 0.9% | |
| 11162009.1 | 1 | 0.9% | |
| Other values (99) | 99 | 86.1% | |
| (Missing) | 6 | 5.2% |
| Value | Count | Frequency (%) | |
| 11636.91085 | 1 | 0.9% | |
| 13634.29553 | 1 | 0.9% | |
| 152348.6922 | 1 | 0.9% | |
| 154722.6851 | 1 | 0.9% | |
| 192080.289 | 1 | 0.9% |
| Value | Count | Frequency (%) | |
| 24529384.83 | 1 | 0.9% | |
| 24212700.59 | 1 | 0.9% | |
| 21156207.46 | 1 | 0.9% | |
| 20863089.04 | 1 | 0.9% | |
| 19438138.54 | 1 | 0.9% |
| Distinct | 109 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 6 |
| Missing (%) | 5.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 527869724.9 |
|---|---|
| Minimum | 1141891.909 |
| Maximum | 2272727262 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 920.0 B |
Quantile statistics
| Minimum | 1141891.909 |
|---|---|
| 5-th percentile | 22716200.49 |
| Q1 | 111967499 |
| median | 341325031.9 |
| Q3 | 805031078.5 |
| 95-th percentile | 1678703748 |
| Maximum | 2272727262 |
| Range | 2271585370 |
| Interquartile range (IQR) | 693063579.4 |
Descriptive statistics
| Standard deviation | 531685052.9 |
|---|---|
| Coefficient of variation (CV) | 1.007227783 |
| Kurtosis | 1.499962486 |
| Mean | 527869724.9 |
| Median Absolute Deviation (MAD) | 259244705.3 |
| Skewness | 1.402695249 |
| Sum | 5.753780001e+10 |
| Variance | 2.826889954e+17 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 621282899.3 | 1 | 0.9% | |
| 997471450.4 | 1 | 0.9% | |
| 103276278.4 | 1 | 0.9% | |
| 35204541.61 | 1 | 0.9% | |
| 229332428.5 | 1 | 0.9% | |
| 123274516.1 | 1 | 0.9% | |
| 607535109 | 1 | 0.9% | |
| 1554987569 | 1 | 0.9% | |
| 1789913891 | 1 | 0.9% | |
| 170080765 | 1 | 0.9% | |
| Other values (99) | 99 | 86.1% | |
| (Missing) | 6 | 5.2% |
| Value | Count | Frequency (%) | |
| 1141891.909 | 1 | 0.9% | |
| 1397965.334 | 1 | 0.9% | |
| 14508713.62 | 1 | 0.9% | |
| 14601528.86 | 1 | 0.9% | |
| 18803080.69 | 1 | 0.9% |
| Value | Count | Frequency (%) | |
| 2272727262 | 1 | 0.9% | |
| 2205626360 | 1 | 0.9% | |
| 1999286345 | 1 | 0.9% | |
| 1896025951 | 1 | 0.9% | |
| 1789913891 | 1 | 0.9% |
| Distinct | 115 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 41420.08889 |
|---|---|
| Minimum | 81.7376 |
| Maximum | 191460.3345 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 920.0 B |
Quantile statistics
| Minimum | 81.7376 |
|---|---|
| 5-th percentile | 1038.58906 |
| Q1 | 7358.5501 |
| median | 24547.5285 |
| Q3 | 56453.99515 |
| 95-th percentile | 136401.0505 |
| Maximum | 191460.3345 |
| Range | 191378.5969 |
| Interquartile range (IQR) | 49095.44505 |
Descriptive statistics
| Standard deviation | 45332.63639 |
|---|---|
| Coefficient of variation (CV) | 1.094460142 |
| Kurtosis | 1.538064474 |
| Mean | 41420.08889 |
| Median Absolute Deviation (MAD) | 21259.4057 |
| Skewness | 1.461301193 |
| Sum | 4763310.223 |
| Variance | 2055047922 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 10981.575 | 1 | 0.9% | |
| 131712.0492 | 1 | 0.9% | |
| 51639.8826 | 1 | 0.9% | |
| 45488.5497 | 1 | 0.9% | |
| 63992.7195 | 1 | 0.9% | |
| 6825.5168 | 1 | 0.9% | |
| 2554.6026 | 1 | 0.9% | |
| 100889.7414 | 1 | 0.9% | |
| 81.7376 | 1 | 0.9% | |
| 131866.0131 | 1 | 0.9% | |
| Other values (105) | 105 | 91.3% |
| Value | Count | Frequency (%) | |
| 81.7376 | 1 | 0.9% | |
| 96.4849 | 1 | 0.9% | |
| 190.188 | 1 | 0.9% | |
| 760.563 | 1 | 0.9% | |
| 839.745 | 1 | 0.9% |
| Value | Count | Frequency (%) | |
| 191460.3345 | 1 | 0.9% | |
| 188258.7171 | 1 | 0.9% | |
| 163119.2616 | 1 | 0.9% | |
| 160197.7023 | 1 | 0.9% | |
| 149907.9339 | 1 | 0.9% |
| Distinct | 109 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 6 |
| Missing (%) | 5.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 51040.12348 |
|---|---|
| Minimum | 103.5685066 |
| Maximum | 213405.648 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 920.0 B |
Quantile statistics
| Minimum | 103.5685066 |
|---|---|
| 5-th percentile | 2035.369815 |
| Q1 | 10820.96648 |
| median | 33513.97216 |
| Q3 | 74742.23597 |
| 95-th percentile | 160125.5169 |
| Maximum | 213405.648 |
| Range | 213302.0795 |
| Interquartile range (IQR) | 63921.26948 |
Descriptive statistics
| Standard deviation | 51230.728 |
|---|---|
| Coefficient of variation (CV) | 1.003734405 |
| Kurtosis | 1.128386431 |
| Mean | 51040.12348 |
| Median Absolute Deviation (MAD) | 24918.86576 |
| Skewness | 1.323510669 |
| Sum | 5563373.459 |
| Variance | 2624587491 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 4540.932047 | 1 | 0.9% | |
| 23279.57247 | 1 | 0.9% | |
| 128264.1616 | 1 | 0.9% | |
| 121.3452302 | 1 | 0.9% | |
| 3160.950679 | 1 | 0.9% | |
| 10758.29886 | 1 | 0.9% | |
| 108190.5139 | 1 | 0.9% | |
| 181943.3842 | 1 | 0.9% | |
| 7610.672328 | 1 | 0.9% | |
| 11169.6819 | 1 | 0.9% | |
| Other values (99) | 99 | 86.1% | |
| (Missing) | 6 | 5.2% |
| Value | Count | Frequency (%) | |
| 103.5685066 | 1 | 0.9% | |
| 121.3452302 | 1 | 0.9% | |
| 1355.903361 | 1 | 0.9% | |
| 1377.031897 | 1 | 0.9% | |
| 1709.514572 | 1 | 0.9% |
| Value | Count | Frequency (%) | |
| 213405.648 | 1 | 0.9% | |
| 210650.4951 | 1 | 0.9% | |
| 181943.3842 | 1 | 0.9% | |
| 181508.8747 | 1 | 0.9% | |
| 169111.8053 | 1 | 0.9% |
| Distinct | 109 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 6 |
| Missing (%) | 5.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4609996.117 |
|---|---|
| Minimum | 10162.83799 |
| Maximum | 19772727.18 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 920.0 B |
Quantile statistics
| Minimum | 10162.83799 |
|---|---|
| 5-th percentile | 202174.1844 |
| Q1 | 984720.7318 |
| median | 2969527.777 |
| Q3 | 7003770.383 |
| 95-th percentile | 14604722.61 |
| Maximum | 19772727.18 |
| Range | 19762564.34 |
| Interquartile range (IQR) | 6019049.651 |
Descriptive statistics
| Standard deviation | 4619333.356 |
|---|---|
| Coefficient of variation (CV) | 1.002025433 |
| Kurtosis | 1.468881458 |
| Mean | 4609996.117 |
| Median Absolute Deviation (MAD) | 2255428.936 |
| Skewness | 1.390854774 |
| Sum | 502489576.7 |
| Variance | 2.133824065e+13 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 5076115.512 | 1 | 0.9% | |
| 978241.0702 | 1 | 0.9% | |
| 2969527.777 | 1 | 0.9% | |
| 15514235.86 | 1 | 0.9% | |
| 417117.8408 | 1 | 0.9% | |
| 9697233.566 | 1 | 0.9% | |
| 5405161.224 | 1 | 0.9% | |
| 4165300.917 | 1 | 0.9% | |
| 3365083.998 | 1 | 0.9% | |
| 1864004.113 | 1 | 0.9% | |
| Other values (99) | 99 | 86.1% | |
| (Missing) | 6 | 5.2% |
| Value | Count | Frequency (%) | |
| 10162.83799 | 1 | 0.9% | |
| 12441.89147 | 1 | 0.9% | |
| 129127.5512 | 1 | 0.9% | |
| 129953.6069 | 1 | 0.9% | |
| 167347.4181 | 1 | 0.9% |
| Value | Count | Frequency (%) | |
| 19772727.18 | 1 | 0.9% | |
| 19188949.33 | 1 | 0.9% | |
| 17393791.2 | 1 | 0.9% | |
| 16305823.18 | 1 | 0.9% | |
| 15514235.86 | 1 | 0.9% |
| Distinct | 114 |
|---|---|
| Distinct (%) | 99.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3907.626087 |
|---|---|
| Minimum | 2 |
| Maximum | 22664 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 920.0 B |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 56.7 |
| Q1 | 252.5 |
| median | 841 |
| Q3 | 5584 |
| 95-th percentile | 16083.3 |
| Maximum | 22664 |
| Range | 22662 |
| Interquartile range (IQR) | 5331.5 |
Descriptive statistics
| Standard deviation | 5562.196624 |
|---|---|
| Coefficient of variation (CV) | 1.423420896 |
| Kurtosis | 2.016616615 |
| Mean | 3907.626087 |
| Median Absolute Deviation (MAD) | 773 |
| Skewness | 1.67725933 |
| Sum | 449377 |
| Variance | 30938031.29 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 61 | 2 | 1.7% | |
| 257 | 1 | 0.9% | |
| 438 | 1 | 0.9% | |
| 5045 | 1 | 0.9% | |
| 11187 | 1 | 0.9% | |
| 5554 | 1 | 0.9% | |
| 7089 | 1 | 0.9% | |
| 687 | 1 | 0.9% | |
| 683 | 1 | 0.9% | |
| 8109 | 1 | 0.9% | |
| Other values (104) | 104 | 90.4% |
| Value | Count | Frequency (%) | |
| 2 | 1 | 0.9% | |
| 3 | 1 | 0.9% | |
| 14 | 1 | 0.9% | |
| 32 | 1 | 0.9% | |
| 33 | 1 | 0.9% |
| Value | Count | Frequency (%) | |
| 22664 | 1 | 0.9% | |
| 22341 | 1 | 0.9% | |
| 19628 | 1 | 0.9% | |
| 19049 | 1 | 0.9% | |
| 17625 | 1 | 0.9% |
| Distinct | 90 |
|---|---|
| Distinct (%) | 79.6% |
| Missing | 2 |
| Missing (%) | 1.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 249.2831858 |
|---|---|
| Minimum | 6 |
| Maximum | 887 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 920.0 B |
Quantile statistics
| Minimum | 6 |
|---|---|
| 5-th percentile | 11.6 |
| Q1 | 26 |
| median | 81 |
| Q3 | 457 |
| 95-th percentile | 765.2 |
| Maximum | 887 |
| Range | 881 |
| Interquartile range (IQR) | 431 |
Descriptive statistics
| Standard deviation | 272.1527102 |
|---|---|
| Coefficient of variation (CV) | 1.091741143 |
| Kurtosis | -0.5732599668 |
| Mean | 249.2831858 |
| Median Absolute Deviation (MAD) | 72 |
| Skewness | 0.8817225764 |
| Sum | 28169 |
| Variance | 74067.09766 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 16 | 4 | 3.5% | |
| 26 | 4 | 3.5% | |
| 17 | 3 | 2.6% | |
| 33 | 2 | 1.7% | |
| 24 | 2 | 1.7% | |
| 22 | 2 | 1.7% | |
| 566 | 2 | 1.7% | |
| 42 | 2 | 1.7% | |
| 325 | 2 | 1.7% | |
| 46 | 2 | 1.7% | |
| Other values (80) | 88 | 76.5% |
| Value | Count | Frequency (%) | |
| 6 | 2 | 1.7% | |
| 8 | 1 | 0.9% | |
| 9 | 1 | 0.9% | |
| 10 | 1 | 0.9% | |
| 11 | 1 | 0.9% |
| Value | Count | Frequency (%) | |
| 887 | 1 | 0.9% | |
| 877 | 1 | 0.9% | |
| 874 | 1 | 0.9% | |
| 865 | 1 | 0.9% | |
| 825 | 1 | 0.9% |
| Distinct | 63 |
|---|---|
| Distinct (%) | 55.8% |
| Missing | 2 |
| Missing (%) | 1.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10911.37168 |
|---|---|
| Minimum | 6 |
| Maximum | 28169 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 920.0 B |
Quantile statistics
| Minimum | 6 |
|---|---|
| 5-th percentile | 236.6 |
| Q1 | 3561 |
| median | 9350 |
| Q3 | 16828 |
| 95-th percentile | 26027 |
| Maximum | 28169 |
| Range | 28163 |
| Interquartile range (IQR) | 13267 |
Descriptive statistics
| Standard deviation | 8270.968743 |
|---|---|
| Coefficient of variation (CV) | 0.7580136563 |
| Kurtosis | -0.9005422056 |
| Mean | 10911.37168 |
| Median Absolute Deviation (MAD) | 6509 |
| Skewness | 0.4929948623 |
| Sum | 1232985 |
| Variance | 68408923.95 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 20746 | 2 | 1.7% | |
| 13637 | 2 | 1.7% | |
| 3182 | 2 | 1.7% | |
| 24730 | 2 | 1.7% | |
| 14945 | 2 | 1.7% | |
| 6408 | 2 | 1.7% | |
| 2381 | 2 | 1.7% | |
| 7361 | 2 | 1.7% | |
| 3561 | 2 | 1.7% | |
| 15619 | 2 | 1.7% | |
| Other values (53) | 93 | 80.9% |
| Value | Count | Frequency (%) | |
| 6 | 1 | 0.9% | |
| 18 | 1 | 0.9% | |
| 44 | 1 | 0.9% | |
| 82 | 1 | 0.9% | |
| 130 | 1 | 0.9% |
| Value | Count | Frequency (%) | |
| 28169 | 2 | 1.7% | |
| 27347 | 2 | 1.7% | |
| 26579 | 2 | 1.7% | |
| 25659 | 2 | 1.7% | |
| 24730 | 2 | 1.7% |
| Distinct | 111 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4 |
| Missing (%) | 3.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 30.84157042 |
|---|---|
| Minimum | 0.8691674291 |
| Maximum | 416.6666667 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 920.0 B |
Quantile statistics
| Minimum | 0.8691674291 |
|---|---|
| 5-th percentile | 1.198918514 |
| Q1 | 1.632483811 |
| median | 30.89187114 |
| Q3 | 35.91640718 |
| 95-th percentile | 89.37060033 |
| Maximum | 416.6666667 |
| Range | 415.7974992 |
| Interquartile range (IQR) | 34.28392337 |
Descriptive statistics
| Standard deviation | 49.93757402 |
|---|---|
| Coefficient of variation (CV) | 1.619164438 |
| Kurtosis | 34.21563965 |
| Mean | 30.84157042 |
| Median Absolute Deviation (MAD) | 29.02841188 |
| Skewness | 5.028845992 |
| Sum | 3423.414316 |
| Variance | 2493.761299 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 46.70018856 | 1 | 0.9% | |
| 2.244844688 | 1 | 0.9% | |
| 1.833660773 | 1 | 0.9% | |
| 89.89361702 | 1 | 0.9% | |
| 1.64797914 | 1 | 0.9% | |
| 1.369439868 | 1 | 0.9% | |
| 1.774583853 | 1 | 0.9% | |
| 1.21927237 | 1 | 0.9% | |
| 35.30926423 | 1 | 0.9% | |
| 1.568813882 | 1 | 0.9% | |
| Other values (101) | 101 | 87.8% | |
| (Missing) | 4 | 3.5% |
| Value | Count | Frequency (%) | |
| 0.8691674291 | 1 | 0.9% | |
| 1.025115325 | 1 | 0.9% | |
| 1.095197978 | 1 | 0.9% | |
| 1.156515035 | 1 | 0.9% | |
| 1.166909773 | 1 | 0.9% |
| Value | Count | Frequency (%) | |
| 416.6666667 | 1 | 0.9% | |
| 233.3333333 | 1 | 0.9% | |
| 150 | 1 | 0.9% | |
| 101.5384615 | 1 | 0.9% | |
| 93.90243902 | 1 | 0.9% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| year_month | transfer_type | transfer_value_eur | transfer_value_gbp | transfer_value_inr | revenue_value_eur | revenue_value_gbp | revenue_value_inr | transfers | new_users | users | activer_user_rate | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2018-12 | Business | 5343580 | 5.959794e+06 | 5.340085e+08 | 45954.7880 | 51254.230706 | 4.592473e+06 | 1076 | 46.0 | 28169.0 | 1.660146 |
| 1 | 2018-12 | Personal | 18967356 | 2.115621e+07 | 1.896026e+09 | 163119.2616 | 181943.384193 | 1.630582e+07 | 19628 | 776.0 | 28169.0 | 30.891871 |
| 2 | 2018-11 | Business | 4850567 | 5.513850e+06 | 5.129096e+08 | 42053.0553 | 47804.915159 | 4.447388e+06 | 981 | 38.0 | 27347.0 | 1.670492 |
| 3 | 2018-11 | Personal | 16960138 | 1.926924e+07 | 1.789914e+09 | 146982.8044 | 166999.490607 | 1.551424e+07 | 17533 | 730.0 | 27347.0 | 30.050040 |
| 4 | 2018-10 | Business | 5658745 | 6.411979e+06 | 6.145070e+08 | 49231.0815 | 55784.221350 | 5.346211e+06 | 1122 | 55.0 | 26579.0 | 1.769360 |
| 5 | 2018-10 | Personal | 18413529 | 2.086309e+07 | 1.999286e+09 | 160197.7023 | 181508.874666 | 1.739379e+07 | 19049 | 865.0 | 26579.0 | 32.425270 |
| 6 | 2018-09 | Business | 5918341 | 6.620784e+06 | 6.212829e+08 | 51489.5667 | 57600.818972 | 5.405161e+06 | 1192 | 42.0 | 25659.0 | 1.953093 |
| 7 | 2018-09 | Personal | 21638933 | 2.421270e+07 | 2.272727e+09 | 188258.7171 | 210650.495102 | 1.977273e+07 | 22341 | 887.0 | 25659.0 | 35.960372 |
| 8 | 2018-08 | Business | 5994491 | 6.686518e+06 | 6.005697e+08 | 52152.0717 | 58172.708498 | 5.224957e+06 | 1207 | 55.0 | 24730.0 | 1.911684 |
| 9 | 2018-08 | Personal | 22006935 | 2.452938e+07 | 2.205626e+09 | 191460.3345 | 213405.648010 | 1.918895e+07 | 22664 | 874.0 | 24730.0 | 35.872442 |
Last rows
| year_month | transfer_type | transfer_value_eur | transfer_value_gbp | transfer_value_inr | revenue_value_eur | revenue_value_gbp | revenue_value_inr | transfers | new_users | users | activer_user_rate | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 105 | 2014-07 | Personal | 766912 | 965742.291361 | 9.894196e+07 | 6825.5168 | 8595.106393 | 880583.404020 | 549 | 149.0 | 636.0 | 66.324435 |
| 106 | 2014-06 | Personal | 689455 | 855131.722216 | 8.609865e+07 | 6136.1495 | 7610.672328 | 766277.964540 | 490 | 129.0 | 487.0 | 84.357542 |
| 107 | 2014-05 | Personal | 521607 | 638713.737560 | 6.400720e+07 | 4642.3023 | 5684.552264 | 569664.117578 | 368 | 89.0 | 358.0 | 88.847584 |
| 108 | 2014-04 | Personal | 378156 | 258461.361281 | 2.625997e+07 | 3365.5884 | 2300.306115 | 233713.770392 | 271 | 81.0 | 269.0 | 89.893617 |
| 109 | 2014-03 | Personal | 287034 | NaN | NaN | 2554.6026 | NaN | NaN | 208 | 58.0 | 188.0 | 101.538462 |
| 110 | 2014-02 | Personal | 159706 | NaN | NaN | 1421.3834 | NaN | NaN | 110 | 48.0 | 130.0 | 93.902439 |
| 111 | 2014-01 | Personal | 158711 | NaN | NaN | 1412.5279 | NaN | NaN | 114 | 38.0 | 82.0 | 150.000000 |
| 112 | 2013-12 | Personal | 93305 | NaN | NaN | 839.7450 | NaN | NaN | 68 | 26.0 | 44.0 | 233.333333 |
| 113 | 2013-11 | Personal | 84507 | NaN | NaN | 760.5630 | NaN | NaN | 60 | 12.0 | 18.0 | 416.666667 |
| 114 | 2013-10 | Personal | 21132 | NaN | NaN | 190.1880 | NaN | NaN | 14 | 6.0 | 6.0 | NaN |